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CYSTEINE PROTEASE PROMOTER FROM OIL SEED RAPE AND A METHOD FOR THE 
CONTAINMENT OF PLANT GERMPLASM 

The present invention relates to promoters and to a construct comprising the same. 
The present invention also relates to a method for the containment of plant germplasm. 

In particular, the present invention relates to the use of a promoter for the expression 
of a gene of interest (GOI) in a specific tissue or tissues of a plant. 

More particularly, the present invention relates to promoters for cysteine proteases. 
The present invention also relates to the application of these cysteine protease promoters to 
express a GOI in a specific tissue or tissues of a plant. 

Promoters control the spatial and temporal expression of genes by modulating their 
level of transcription. Early approaches to genetically engineered crop plants utilised strong 
constitutive promoters to drive the expression of foreign genes. As strategies in plant 
biotechnology have become more sophisticated, there are requirements for specific promoters 
to target transgene expression to a particular tissue or to a particular developmental stage. 

Cysteine proteases are members of a large multigene family in plants (Praekelt et al, 
1988; Goetting-Minesky and Mullin, 1994), animals (Wiederanders et al, 1992) and protozoa 
(Mallinson et al 1994). Cysteine proteases are synthesised as an inactive precursor (Praekelt 
et al, 1988). The pre-pro-enzyme is targeted to the secretory pathway (Marttila et al ,1995) 
and post-transcriptionally processed in the vacuoles by proteolytic cleavage of the propeptide 
fragment to produce the active enzyme (Hara-Nishimura et al, 1993 and 1994). 

Plant cysteine proteases participate in different metabolic events of physiological 
importance. During seed germination and plant senescence they are involved in protein 
degradation (Jones et al, 1995; Valpuesta et al, 1995; Smart et al, 1995) and play a key role 
in protein storage mobilisation during germination (Boylan and Sussex, 1987). During seed 
development, cysteine proteases catalyse the post-translational processing of protein 
precursors into their mature form (Hara-Nishimura et al, 1995). In addition, some are 
subjected to hormonal regulation either by giberellic acid (Koehler and Ho, 1990; Watanabe 
et al, 1991) or ethylene (Cervantes et al, 1994; Jones et al, 1995). Others are induced in 
response to stress like wounding (Linthorst et al, 1993; Lidgett et al, 1995), dehydration 
(Guerrero et al, 1990), cold (Schaffer and Fischer, 1988) or are implicated in plant-microbe 
interactions (Goetting-Minesky and Mullin, 1994). 
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Gennination specific cysteine proteases have been characterised for barley (Marttila et 
al., 1995), rice (Watanabe et al, 1991), maize (Debarros and Larkins, 1994), chick-pea 
(Cervantes et al, 1994), vetch (Becker et al, 1994) and a cysteine protease has been 
described for oil seed rape (Comai and Harada, 1989). However, the published data for oil 
5 seed rape is contradictory. Furthermore, this species is difficult to study due to its amphi- 
diploid nature. Rather than using more conventional and laborious techniques like subtractive 
or differential screening of cDNA libraries or differential display techniques, potentially 
generating clones of unknown identity, cysteine proteinases (cysteine proteases) in oil seed 
rape were studied which are expressed in germinating seeds. Promoters from genes which are 
10 uniquely expressed following seed germination were isolated and characterised. 

Thus, according to a first aspect of the present invention, there is provided an oil seed 
rape cysteine protease gene promoter of class 1, 2 or 6. 

According to a second aspect of the present invention, there is provided a promoter 
comprising at least part of a sequence as shown in Figures 19, 20 or 21, or at least part of a 
15 sequence that has substantial homology therewith, or a variant thereof. 

According to a third aspect of the present invention, there is provided a promoter 
having the characteristic motifs or features of promoters of the present invention. 

According to a fourth aspect of the present invenuon, there is provided a recombinant 
DNA construct comprising the promoter as defined above operably linked to a gene which 
20 codes for a protein of interest. 

According to a fifth aspect of the present invenuon, there is provided a recombinant 
DNA construct functional in a plant comprising a disrupter gene encoding a product capable 
of disrupting cell function, and a promoter as defined above, the disrupter gene being 
functionally linked to and controlled by an externally regulatable gene control region which 
25 includes a promoter which is inducible by the external application of a chemical inducer. 

According to a sixth aspect of the present invention, there is provided DNA 
comprising at least part of the sequence shown in Figures 12, 13, 14, 15, 16 or 17, or at least 
part of a sequence that has substantial homology therewith or a variant thereof, and which 
codes for a cysteine protease. 
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According to a seventh aspect of the present invention, there is provided a 
recombinant DNA construct functional in a plant comprising the DNA as defined above 
operably linked to a promoter. 

According to an eighth aspect of the present invention, there is provided an expression 
system for the tissue or tissues of a plant material, the expression system comprising a gene of 
interest fused to a gene promoter as defined above wherein the expression system is capable 
of being expressed in the tissue or tissues of the plant material. 

According to a ninth aspect of the present invention, there is provided an expression 
system comprising a construct as defined above. 

According to a tenth aspect of the present invention, there is provided a recombinant 
plant genome comprising a promoter as defined above, DNA as defined above, a recombinant 
DNA construct as defined above or an expression system as defined above. 

According to an eleventh aspect of the present invention, there is provided a plant, 
plant seed or plant cell having a recombinant plant genome as defined above. 

According to a twelfth aspect of the present invention, there is provided protected 
germplasm comprising a recombinant DNA construct as defined above. 

According to a thirteenth aspect of the present invention, there is provided a plant or 
seed which is capable of growing to maturity comprising a recombinant DNA construct as 
defined above. 

According to a fourteenth aspect of the present invention, there is provided the use of 
a gene promoter as defined above to induce expression of a gene of interest when fused to the 
gene promoter in the tissue or tissues of a plant material. 

Preferably, the inducible promoter of the recombinant DNA construct is functionally 
linked to and controls a repressor protein and the disrupter gene promoter includes an 
operator sequence which is recognised by the repressor protein, so that in the presence of the 
inducer the repressor protein is produced which interacts with the operator sequence thereby 
disabling the second promoter and inhibiting expression of the disrupter gene. 

Preferably, the disrupter gene is a nucleotide sequence, which is in sense orientation to 
an endogenous plant gene which is essential to plant development or a gene conferring a 
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desired characteristic on the plant, or comprises a partial sense sequence of the endogenous 
plant gene. 

Preferably, the disrupter gene is a nucleotide sequence which is in antisense orientation 
to an endogenous plant gene which is essential to plant development or a gene conferring a 
5 desired characteristic on the plant. 

Preferably, the endogenous plant gene is essential to seed germination or early 
seedling development. 

Preferably, the externally regulatable gene control region is a chemically inducible 
gene promoter sequence from the glutathione S-transferase system (which is the subject of 
10 our International Patent Application No. PCT/ GB96/021 16), the Ale system (which is the 
subject of our International Patent Application Nos. PCT/GB96/01883 and 
PCT/GB96/01846) or the ecdysone system (which is the subject of our International Patent 
Application No. PCT/GB96/01 195). 

Preferably, the repressor protein gene encodes a bacterial repressor such as the lac 
1 5 repressor or a repressor used by 434, P22 or lambda-bacteriophages. 

Preferably, the disrupter gene or disrupter promoter contains a "pseudo-operator". 

Preferably, the disrupter gene is a cytotoxic gene. 

Preferably, the disrupter gene encodes a recombinase or a transposase adapted to 
excise a nucleotide sequence flanked by recombinase recognition sequences. 

20 Preferably, the recombinant DN A construct is capable of being expressed in the tissue 

or tissues of a germinating seedling or a developing grain or a plant when the construct is 
integrated, preferably stably integrated, within the grain's or seedling's or plant's genomic 
DNA. 

Preferably, the expression system is for at least the tissue of a germinating seedling or 
25 developing grain or plant (eg in the root, cotyledons, leaves and stem). 

Preferably, the expression system is integrated, preferably stably integrated, within a 
germinating seedling's genomic DNA or a developing grain's genomic DNA or a plant's 
genomic DNA. 
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Preferably, the gene promoter is used to induce expression of a gene of interest when 
fused to the gene promoter in at least the tissue or tissues of a germinating seedling or a 
developing grain or a plant (eg in the root, cotyledons, leaves and stem). 

According to a preferred embodiment of the present invention, the promoter 
comprises a DNA sequence corresponding to that of the promoter region of the clone 
pKS12p6, as shown in Figure 19. 

According to another preferred embodiment of the present invention, the promoter 
comprises a DNA sequence corresponding to that of the promoter region of the clone 
pKS25p7, as shown in Figure 20. 

According to a further preferred embodiment of the present invention, the promoter 
comprises a DNA sequence corresponding to that of the promoter region of the clone 
pKS66pl, as shown in Figure 21. 

An even more preferred embodiment of the present invention is a seedling, grain or 
plant comprising a construct comprising a disrupter gene fused to a cysteine protease 
promoter, wherein the construct is integrated, preferably stably integrated within the 
seedling's, grain's or plant's genomic DNA, wherein the promoter comprises at least part of a 
sequence shown in Figures 19, 20 or 21 , or at least part of a sequence that has substantial 
homology therewith, or a variant thereof and wherein the disrupter gene is a gene which 
encodes barnase ribonuclease. 

An even more preferred embodiment of the present invention is a seedling, grain or 
plant comprising a construct comprising a disrupter gene fused to a cysteine protease 
promoter, wherein the construct is integrated, preferably stably integrated within the 
seedling's, grain's or plant's genomic DNA, wherein the promoter comprises at least part of a 
sequence shown in Figures 19, 20 or 21, or at least part of a sequence that has substantia] 
homology therewith, or a variant thereof and wherein the disrupter gene is a gene which 
encodes a recombinase adapted to excise a nucleotide sequence flanked by recombinase 
recognition sequences. 

Thus, according to a highly preferred embodiment of the present invention, there is 
provided a recombinant DNA construct for insertion into the genome of a plant to impart 
control of plant development thereto, comprises, in sequence: 
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(a) an inducible gene promoter sequence responsive to the presence or absence of an 
exogenous chemical inducer; 

(b) either a gene encoding a repressor protein under control of the said inducible 
gene promoter sequence or a gene encoding an inhibitor of the product of the 

disrupter gene specified at (e) below; 

(c) an operator sequence responsive to the said repressor protein; 

(d) a gene promoter sequence of the present invention; and, 

(e) a gene encoding a protein disrupter of a plant characteristic essential to the 
growth of the plant, whereby the presence or absence of the exogenous chemical 
inducer enables either growth to maturity or causes growth to slow down or stop at an 
appropriate stage. 

The inducible promoter used in the present invention may promote expression of the 
repressor protein in response to stimulation by an exogenous chemical inducer whereby in the 
absence of the chemical inducer nqrepressor protein is expressed to interact with the operator 
thus permitting expression of the disrupter protein gene and in the presence of the chemical 
inducer repressor protein is expressed thereby preventing expression of the gene encoding the 
inhibitor of plant development permitting unimpeded plant growth. 

The term "plant material" includes a developing caryopsis , a germinating caryopsis or 
grain, or a seedling, a plantlet or a plant, or tissues or cells thereof, such as the cells of a 
developing caryopsis or the tissues of a germinating seedling or developing grain or plant (eg 
in the root, leaves and stem). 

The term "gene of interest" or "GOI" with reference to the present invention means 
any gene of interest. A GOI can be any gene that is either foreign or natural to the plant in 
question, except for the wild type functional gene when in its natural environment. 

Typical examples of a GOI includes genes encoding for proteins and enzymes that 
disrupt cell function. For example, the gene may be a cytotoxic gene. Alternatively the gene 
may encode a recombinase, transposase, or a related enzyme with similar properties, adapted 
to inhibit an endogenous plant gene which is essential to plant development or a gene 
conferring a desired characteristic on the plant. 

A recombinase is an enzyme that recognises a specific excision sequence or set of 
specific excision sequences and effects the removal of, or otherwise alters, DNA between 
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specific excision sequences. Recombinase systems such as the Cre-lox, the FLP, SRI and 
SSV1 -encoded integrase systems may be used in the present invention. 

Other examples of a GOI include defensive or protective genes, such as genes giving 
herbicide, fungal or insect resistance. Such genes may be expressed during germination of 
5 seedlings at which time the seedlings are particularly vulnerable. Preferably, the gene encodes 
a protein which confers resistance to biotic and environmental stresses on a plant. 

Also included are endogenous genes such as genes encoding (3-tubulin and adenine 
nucleotide translocator (ANT). 

The term "disrupter gene" is a gene which, when expressed or repressed specifically at 

10 a suitable stage of plant development, will lead to the failure of a plant to reach maturity and 
to set seed. The origin of the disrupter genes can be from a variety of naturally occurring 
sources eg human cells, bacterial cells, yeast cells, plant cells, fungal cells, or they can be 
totally synthetic genes which may be composed of DN A sequences, some of which may be 
found in nature, some of which are not normally found in nature or a mixture of both. The 

1 5 disrupter genes will preferably be targeted to an essential biochemical function, such as DNA 
and RNA metabolism, protein synthesis, and other metabolic pathways. 

In a preferred embodiment, the disrupter gene is a gene which encodes barnase 
ribonuclease, p-tubulin or adenine nucleotide translocator (ANT). 

The term "variant thereof 1 with reference to the present invention means any 

20 substitution of, variation of, modification of, replacement of, deletion of or the addition of one 
or more nucleic acid(s) from or to the promoter sequence providing the resultant sequence is 
capable of expressing a GOL The term also includes sequences that can substantially 
hybridise to the promoter sequence. The term also includes DNA which hybridises to the 
DNA of the present invention and which codes for at least part of a cysteine protease 

25 promoter. Preferably, such hybridisation occurs at, or between, low and high stringency 
conditions. In general terms, low stringency conditions can be defined as 3 x SSC at about 
ambient temperature to about 65°C, and high stringency conditions as 0. 1 x SSC at about 
65°C SSC is the name of a buffer of 0.1 5M NaCl, 0.015M trisodium citrate. 3 x SSC is three 
time as strong as SSC and so on. 

30 The term "substantial homology" covers homology with respect to at least the 

essential nucleic acids/nucleic acid residues of the promoter sequence providing the 
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homologous sequence acts as a promoter eg a cysteine protease promoter which is capable of 
expressing a GOI. Typically, homology is shown when 60% or more of the nucleotides are 
common with the promoter sequence of the present invention, more typically 65%, preferably 
70%, more preferably 75%, even more preferably 80% or 85% and, especially preferred, are 

5 90%, 95%, 98% or 99% or more homology. 

The term "construct" - which is synonymous with terms such as "cassette", "hybrid" 
and "conjugate" - includes a GOI directly or indirectly attached to the promoter of the present 
invention, such as to form a cassette. An example of an indirect attachment is the provision of 
a suitable spacer group such as an intron sequence, intermediate the promoter and the GOI. 

10 The same is true for the term "fused" in relation to the present invention which includes direct 
or indirect attachment. 

The term "expression system" means that the system defined above can be expressed 
in an appropriate organism, tissue, cell or medium. In this regard, the expression system of 
the present invention may comprise additional components that ensure to increase the 

15 expression of the GOI by use of the gene promoter. 

The DN A of the present invention may be genomic DNA which is in an isolated form 
and is, preferably, operably linked to a sequence with which it is not naturally associated, or 
the DNA may be synthetic DNA or cDNA. 

Young seedlings at germination represent a vulnerable stage of plant development. 

20 Strategies to improve crop production may include expression of genes during this stage to 
enhance the seedling's resistance to biotic and environmental stresses such as resistance to 
cold, salt, heavy metals, fungal attack. Protecting seedlings by expressing proteins which 
provide at least some measure of resistance/tolerance against such stresses, for example anti- 
fungal proteins (Cammue et a/ M 1992; Terras et aU 1993) under the control of a germination 

25 specific promoter will limit the expression to a precise phase of development, when the 

proteins will be most effective. Such development specific expression has further advantages 
such as avoiding expression of these genes in plant material entering the food chain. 

The promoters of the present invention may also be advantageously used in plant 
germplasm containment systems. 
30 Agriculture uses many crop plants for the production of food for human consumption, 

for commercial processes yielding products for human consumption, for animal feedstuff 
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production, for the development of industrial products and other purposes. The process 
involves the planting by the farmer of seed which usually has been purchased from a seed 
producer. The product produced by the crop, be it the whole plant, the seed or fruit of the 
plant, is harvested and is then used for the various food applications mentioned above. 

The supplied hybrid or inbred seed may incorporate novel genetic information 
introduced by transformation of the crop giving novel agronomic features such as tolerance to 
herbicides, insect pests, and fungal diseases, improved yield and/or quality of the harvested 
product, and novel mechanisms for the control of plant fertility. Such improvements which 
are made possible through biotechnological research, improve the quality of the plant 
breeding and improve the agronomic performance of the seed supplied to the farmer. 

A problem addressed by the present invention is the containment of crop plants within 
the area of cultivation. Seeds of cultivated crop plants may be conveyed outside the defined 
growing area by a number of routes (by birds or small mammals or simply by being dropped 
during post-harvest transport of a seed crop) where they assume the status of weeds, or they 
may remain as volunteers in a subsequent crop in later years. It would clearly be appropriate, 
if it were possible, that cultivated crops be confined to the growing area and prevented from 
persisting in the wild. It will be appreciated that the problems of crop non-confinement 
mentioned above become more acute where transgenic crops are involved. 

In the same way, pollen can travel long distances through wind and/or insect 
transportation (dispersion) and remain viable for a long period. Since interspecific crossing 
between crop plants and their non cultivated related species is possible, a second concern is 
the escape of pollen from transgenic crops e.g. herbicide resistant crops, to their related 
weeds species (Mikkelsen et al., 1996). Ways to reduce viability of such hybrids would limit 
the risk of transgene escape to non-crop species thus avoiding the spreading of plants with 
enhanced invasiveness or weediness. 

It will be appreciated that the use of the seedling promoters of the present invention 
restricts expression of the disrupter protein gene to a suitable stage of plant development, and 
also means that it is not necessary to continue to apply an inducer chemical to the plant 
throughout its lifetime in order to maintain its viability. This has both economic and ecological 
benefits. 
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The invention also provides a genetically transformed plant and parts thereof, such as 
cells protoplasts and seeds, having incorporated, preferably stably incorporated, into the 
genome the construct of the present invention. 

Thus, the invention provides a plant which can be reversibly inhibited at an appropriate 
5 developmental stage in which said plant contains, preferably stably incorporated in its 
genome, the recombinant DNA construct defined above. 

Expression of a protein encoded by a gene is controlled by the interaction of certain 
regulatory proteins, known as DNA-binding proteins, with a region located upstream of the 
gene. Within the promoter region, there are located several operator regions which contain a 
10 specific oligonucleotide sequence to which these DNA-binding proteins specifically bind. 
These proteins can lead either to activation or repression of gene expression. Thus, they 
control the regulated expression of genes. 

These DNA-binding proteins, which may in fact be either repressors or activators of 
gene expression, are herein referred to for the sake of simplicity as "repressors". 
1 5 The present invention makes use of the well-characterised interaction between 

bacterial operators with their repressors to control the expression of the disrupter gene 
function. Bacterial repressors, particularly the lac repressor, or repressors used by 434, P22 
and lambda bacteriophages can be used to control the expression in plant cells very 
effectively. 

20 A second operator/repressor system is the subject of our published International 

Patent Application No. WO90/08827 which is incorporated herein by reference. 

A third approach for the down-regulation of the disrupter genes which can be 
considered is the use of either "antisense", "sense" or "partial sense" technology. 

A DNA construct according to the invention may be an "antisense" construct 
25 generating "antisense" RNA or a "partial-sense" construct (encoding at least part of the 
functional gene product) generating "sense" RNA. 

"Antisense RNA" is an RNA sequence which is complementary to a sequence of bases 
in the corresponding mRNA: complementary in the sense that each base (or the majority of 
bases) in the antisense sequence (read in the 3' to 5' sense) is capable of pairing with the 
30 corresponding base (G with C. A with U) in the mRNA sequence read in the 5' to 3" sense. 
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Such antisense RNA may be produced in the cell by transformation with an appropriate DNA 
construct arranged to generate a transcript with at least part of its sequence complementary to 
at least part of the coding strand of the relevant gene (or of a DNA sequence showing 
substantial homology therewith). 

"Sense RNA" is an RNA sequence which is substantially homologous to at least part 
of the corresponding mRNA sequence. Such sense RNA, or partial sense RNA, may be 
produced in the cell by transformation with an appropriate DNA construct arranged in the 
normal orientation so as to generate a transcript with a sequence identical to at least part of 
the coding strand of the relevant gene (or of a DNA sequence showing substantial homology 
therewith). Suitable sense, or partial sense constructs may be used to inhibit gene expression 
(as described in International Patent Publication W09 1/08299). 

Antisense RNA constructs which may be used to down-regulate disrupter genes 
include those encoding adenine nucleotide translocator. 

Other approaches which are, or become, available may also be used. 

Further details on such crop containment systems can be found in our published 
International Patent Application No. WO94/03619 which is incorporated herein by reference. 

The promoter of the present invention, then, when linked to an exogenous or foreign 
gene and introduced into a plant by transformation, provides a means for the regulation of 
expression of that foreign gene. The method employed for transformation of the plant cells is 
not especially germane to this invention and any method suitable for the target plant may be 
employed. Transgenic plants are obtained by regeneration from the transformed cells. 
Numerous transformation procedures are known from the literature such as agroinfection 
using Agrobacterium tumefaciens or its Ti plasmid, electroporation, microinjection or plants 
cells and protoplasts, microprojectile transformation, to mention but a few. Reference may be 
made to the literature for full details of the known methods. 

Neither is the plant species into which the promoter sequence is inserted particularly 
germane to the invention. Dicotyledonous and monocotyledonous plants can be transformed. 
This invention may be applied to any plant for which transformation techniques are, or 
become, available. The present invention can therefore be used to control gene expression in 
a variety of genetically modified plants, including field crops such as canola, sunflower, 
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tobacco, sugarbeet, and cotton; cereals such as wheat, barley, rice, maize, and sorghum; fruit 
such as tomatoes, mangoes, peaches, apples, pears, strawberries, bananas and melons; and 
vegetables such as carrot, lettuce, cabbage and onion. The promoter is also suitable for use 
in a variety of tissues, including roots, leaves, stems and reproductive tissues. 
5 Thus, nucleic acid sequences which code for novel cysteine proteases have been 

isolated and characterised. The DNA comprising at least part of the sequence shown in any 
one of Figures 12 to 17 codes for a cysteine protease and corresponds to the coding region of 
the sequence. The present invention also includes DNA which shows homology to the 
sequences of the present invention. The present invention also includes DNA which hybridises 
10 to the DNA of the present invention and which codes for at least part of a cysteine protease. 
Such homology and hybridisation is discussed above in relation to the promoter sequences. 

The present invention further includes DNA which is degenerate as a result of the 
genetic code to the DNA of the present invention and which codes for a cysteine protease. 
Also provided by the present invention is a cysteine protease, which is substantially 
1 5 free from other proteins with which it is ordinarily associated, and which is coded for by 
cysteine protease gene DNA of the present invention. 

The present invention will now be described by way of non-limiting example only, and 
with references to the accompanying drawings, in which: 

Figure 1 shows a schematic outline of the identification of cysteine protease isoforms 
20 using a reverse transcribed PCR library; 

Figure 2 shows the analysis of RT-PCR products by agrose gel electrophoresis and in 
which DG1 = DEGCYS1 and DG2 = DEGCYS2 oligos; 

Figure 3A shows the alignment of the coding regions of the preliminary nucleic acid 
sequences of RT-PCR clones OSR8.401, OSR8.406, OSR8.403, OSR8.404, OSR8.402, 
25 OSR8,389 and OSR8.387; 

Figure 3B shows the alignment of the non-coding regions of the preliminary nucleic 
acid sequences of RT-PCR clones OSR8.389, OSR8.387, OSR8.402 and OSR8.404; 

Figure 4 shows the preliminary nucleic acid sequence of clones OSR8.401, OSR8.402 
and OSR8.389; 
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Figure 5 shows the results of a northern blot of the class 2 clone, OSR8.402, 
comparing expression in the seed and during germination, using a random primed whole RT- 
PCR fragment as probe. In this Figure, L = leaf, C = cotyledons, S = seeds and B = buds; 

Figure 6 shows the results of northern blots of the class 1 clone, CDCYS12 (using the 
5 coding region of the clone as a probe, labelled by PCR), the class 2 clone, CDCYS25 (using 
the non-coding region of the clone as a probe, labelled by PCR) and the class 6 clone, 
CDCYS66 (using the non-coding region of the clone as a probe, labelled by PCR) comparing 
expression in a range of developmental stages. In this Figure, B = bud, W = whole plant and L 
= leaf; 

10 Figure 7 shows the alignment of the deduced amino acid sequences of clones 

OSR8.403, OSR8.404, OSR.402, OSR.389, OSR8.387, OSR8.406 and OSR8.401 with the 
published sequence of COT44 (designated CYS4.BRANA); 

Figure 8 shows a schematic outline of the identification of cysteine protease isoforms 
using a cDNA library; 

15 Figure 9 shows the alignment of the deduced amino acid sequences of the partial 

length cDNA class 2 clones, CYS2UP6.1, CYS2UP7.1 and CYS2UP8.2, with each other 
and with COT44 (designated CYS4.BRANA); 

Figure 10 shows the alignment of the nucleic acid sequences of the partial cDNA 
clones, CYS2UP6, CYS2UP7, CYS2UP8, CYS6UP3, CYS6UP5, CYS6UP2 and CYS6UP4, 
20 with each other and with COT44; 

Figure 1 1 shows the alignment of the full length cDNA clones, CDCYS66, 
CDCYS24, CDCYS22, CDCYS25, CDCYS12 and CDCYS14, with each other and with 
COT44; 

Figure 12 shows the nucleic acid sequence of cDNA clone CDCYS12; 
25 Figure 13 shows the nucleic acid sequence of cDNA clone CDCYS 14; 

Figure 14 shows the nucleic acid sequence of cDNA clone CDCYS22; 
Figure 15 shows the nucleic acid sequence of cDNA clone CDCYS24; 
Figure 16 shows the nucleic acid sequence of cDNA clone CDCYS25; 
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Figure 17 shows the nucleic acid sequence of cDNA clone CDCYS66; 

Figure 18 shows the alignment of the predicted amino acid sequences of cDNA clones 
CDCYS12, CDCYS 14, CDCYS22, CDCYS24, CDCYS25 and CDCYS66, and compares 
the sequences with the characterising features of plant cysteine proteases; 

5 Figure 19 shows a promoter nucleic acid sequence from class 1 genomic clone 

P KS12p6. 

Figure 20 shows a promoter nucleic acid sequence from class 2 genomic clone 
pKS25p7. 

Figure 21 shows a promoter nucleic acid sequence from class 6 genomic clone 
10 PKS66P1. 

Figure 22 shows the mapping of the transcription start by primer extension 
experiments as well as the position of the putative TATA box of clones pKS12P6 (class 1), 
pKS25P7 (class2) and pKS66Pl (class 6). Numbers indicate the distance to the 5'end of the 
cDNAs. 

15 Figure 23 shows the vector constructs for tobacco transformation. 

Figure 24 shows the levels of GUS expression in shoot-generating calli during 
transformation. 

Figure 25 shows the levels of GUS expression in leaves from primary transformants. 

Figure 26 shows the time course of GUS expression in the progeny of 2 random 
20 primary transformants per class, 0 to 36 days after imbibition (DAI). NVS designates the 
wild type negative control. 

Figure 27 shows the GUS activity in the progeny of 24 primary transformants of class 

1. Seedlings were assessed at 0, 14 and 28 DAI. The Figure also includes the GUS activity 
in young leaves from the primary transformants. 

25 Figure 28 shows the GUS activity in the progeny of 24 primary transformants of class 

2. Seedlings were assessed at 0, 14 and 28 DAI. The Figure also includes the GUS activity 
in young leaves from the primary transformants. 



WO 97/35983 



• 15- 



PCT/GB97/00729 



Figure 29 shows the GUS activity in the progeny of 24 primary transformants of class 
6. Seedlings were assessed at 0, 14 and 28 DAI. The Figure also includes the GUS activity 
in young leaves from the primary transformants. 

In general, in the Figures the designations CO or COD in connection with a clone 
indicate the coding region, and NC or NCD indicates the non-coding region. 

In outline, the Examples describe the amplification of a range of germination- 
expressed cysteine protease partial clones by reverse transcriptase polymerase chain reaction 
(RT-PCR) on germinating oilseed rape RNA. Preliminary assessment of the clones' 
expression in dry seeds and germinating seedlings by northern blotting, followed by more 
detailed northern blot experiments on selected clones to assess their time course of 
expression. A cDNA library was constructed from tissues showing a high expression of these 
clones, followed by screening of a genomic library to subclone the promoter areas. Final 
assessment of the spatial and temporal regulation of the cloned promoters was conducted by 
transcriptional fusion of the promoter fragments with the P-glucuronidase (GUS) reporter 
gene and transformation into tobacco. 

Example 1 - CONSTRUCTION OF A CYSTEINE PROTEASE REVERSE 
TRANSCRIBED PCR LIBRARY FROM GERMINATING OILSEED RAPE 
SEEDLINGS 

In order to identify germination specific sequences, a RT-PCR approach on oilseed 
rape seedling RNA was utilised. A reverse oligo(dT) primer and forward primers, designed to 
the cysteine protease coding regions conserved between several plant species, were used to 
amplify a 750 bp RT-PCR product covering 500 bp of the coding region and about 250 bp of 
the non-coding region. The 5' -end was sequenced to confirm the identity of the RT-PCR 
products as cysteine protease related clones. Since there is much less pressure of selection on 
non-coding regions, significant differences in the 3'-end non-coding regions may predict 
differences in the 3*-end untranslated regions with an effect on the promoter. This general 
approach is shown schematically in Figure 1. 
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Preparatory Methods : 
Plant material 

Five grams of oil-seed rape seeds (Brassica napus) from the variety Westar were 
sterilised in 1% sodium hypochlorite for 10 minutes. After several washes in sterile water, 
seeds were imbibed with sterile water for 12 hours at 4°C in the dark to synchronise the 
germination. They were sown on wet sterile Whattman paper and grown at 25°C in the dark 
prior to harvesting the cotyledons. 

RNA extraction 

Lithium chloride method 

Total RNA was isolated from dry seeds and three days-old oil seed rape seedlings 
using a modification of the protocol described by Jepson et al, (1991). Tissues (5 g) were 
ground with liquid nitrogen in a mortar and pestle until a fine powder was obtained. After 
addition of 9 ml homogenisation buffer [400 mM NaCl; 50 mM Tris-HCl, pH 9.0; 1 % SDS; 
5 mM ethylene diamine tetraacetic acid (EDTA); 4 U/ml heparin; 1 mM aurintricarboxylic 
acid (ATT); 10 mM dithiothreitol (DTT)] and 4.0 ml phenol saturated in homogenisation 
buffer and supplemented with 10 % (v/v) m-cresol before use, the tissues were ground again 
until a fine paste was obtained. The paste was transferred to a cold corex tube and 
centrifuged for 15 minutes at 13,000 rpm (Sorval, SS34, 4°C). The supernatant was 
transferred to another tube, extracted for 5 minutes with 5 ml of phenol-chloroform and 
centrifuged for 30 minutes at 9,000 rpm (Sorval, SS34, 4°C) to recover the aqueous phase. 
After 3 phenol-chloroform extractions, the RNA was recovered by precipitating the 
supernatant overnight on ice with one fifth volume of 12 M lithium chloride. After 
centrifuging for 30 minutes at 9,000 rpm (Sorval, SS34, 4°C), the supernatant was removed 
and the pellet resuspended in 1 ml of 5 mM Tris (pH 7.5) prior to transfer to a microtube. 
After a second lithium chloride precipitation overnight, the pellet was washed twice with 70% 
ethanol, resuspended in 0.2 ml DEPC treated water and stored at -70°C. 
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Caesium chloride method 

Total RNA was isolated from a range of developmental stages of oil seed rape 
seedlings using a protocol modified from Okayama ex al (1979). Tissues (2^4 g) were ground 
in a mortar and pestle with 1 g AI2O3 in the presence of liquid nitrogen. The powder, kept in 

5 dry ice, was then mixed in a corex tube with 8 ml of pre-warmed (65°C) homogenisation 
buffer [5M thiocyanate guanidine; 0-5 %, w/v, lauryl sarcosine sodium; 0.025 M sodium 
citrate; pH 7.0] supplemented before use with 2.5 % (v/v) B-mercaptoethanol, and incubated 
at 40°C for 10 minutes with vortexing until complete defrosting of the tissues. After 
centrifugation at 12,000 rpm for 30 minutes at 15°C (Sorval, SS34 rotor), the supernatant 

10 was recovered, homogenisation buffer added to 8 ml and supplemented with 0. 1 g CsCl per 
ml. After the CsCl had dissolved, the homogenate was added in a 12 ml polyalomer tube 
containing 2.5 ml of a CsCl high density cushion [5.7 M CsCl; 0.1 M EDTA] without 
disturbing the cushion. After centrifugation at 25,000 rpm for 24 hours at 20°C (Sorval, TH- 
641 rotor) the supernatant was removed by suction and the wall of the tube cleaned with 

15 absorbent paper before resuspension of the RNA loop into 300 \i\ resuspension buffer [7 M 
urea; 2%, w/v, lauryl sarcosin sodium]. The RNA was then transferred to a 1.5 ml tube and 
extracted with an equal volume of phenol and an equal volume of chloroform/isoamylic 
alcohol [24: l t v/v]. Following centrifugation at 13,000 rpm for 5 minutes the aqueous phase 
was recovered and extracted again with an equal volume of chloroform. The RNA was 

20 precipitated overnight at -20°C by adding a one-tenth volume of 3 M sodium acetate and 2.5 
volumes of cold ethanol. After centrifugation at 13,000 rpm for 15 minutes at 4°C, the 
supernatant was discarded and the pellet was washed with 1.5 ml of cold 70% ethanol. The 
centrifugation was repeated for 10 minutes, the supernatant was discarded and the pellet dried 
for 3-4 minutes in a speedvacuum. The pellet was resuspended in sterile DEPC-treated water, 

25 the RNA precipitated again and the pellet washed, as described above, and finally 
resuspended in 50 \il DEPC-treated water and stored at -70°C. 
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Labelling of DNA probes 
Terminal exchange 

The oligos (25-50 ng) were labelled by phosphorylating their hydroxylated 5' -end 
using 20 U T4 polynucleotide kinase (New England Biolabs) in a 25 ul reaction [30 uCi [y- 
5 32 P] ATP (Amersham, 5000 Ci/mmol), IX kinasing buffer] at 37°C for 30 minutes. The 
oligos were purified from the unincorporated nucleotides using G-25 sephadex spun columns 
(5prime-»3prime, Inc®), as recommended by the manufacturer. 

PCR 

Plasmid DNA (5 ng) was amplified [(94°C, 1 minute; 65°C, 1 minute; 72°C, 1 minute) 
10 x 17 cycles; ( 72°C, 7 minutes) x 1 cycle] using 2.5 U of Taq polymerase (Gibco BRL) in a 50 
U.1 PCR reaction [0.25 uM [a- 32 P] dATP( Amersham, 3000 Ci/mmol); 0.4 uM dATP; 50 \iM 
other dNTPs; 1.5 mM MgC12; 0.5 U.M oligos; IX PCR buffer]. The probes were purified 
from the unincorporated nucleotides using G-50 sephadex spun columns (Pharmacia Biotech), 
as recommended by the manufacturer. 

15 Random priming 

Plasmid DNA (25-50 ng) was labelled using the "Oligolabelling Kit" (Pharmacia) 
following manufacturer's recommendations. 

Northern blot hybridization 

20 Northern blot experiments were performed according to Sambrook et al. Total RNA 

( 10 ng) was mixed with 2.5 vol of loading buffer (5prime-»3prime, lnc®), sized by 
electrophoresis on a 1.2 % agarose denaturing gel [IX MOPS(40 mM MOPS, pH 7.0; 10 
mM sodium acetate; 1 mM EDTA); 17%, v/v, formaldehyde] in alkaline running buffer [IX 
MOPS; 7%, v/v, formaldehyde] and transferred to nylon membranes (Hybond N, Amersham) 

25 by capillary blotting according to the manufacturer's recommendations. Hybridizations were 
performed overnight at 65°C in hybridisation buffer [5XSSPE (900 mM NaCl; 50 mM 
NaH 2 PO 4 .H 2 0; 5 mM EDTA; pH 7.4, NaOH); 0.5% sodium dodecyl sulphate (SDS); 1% 
powdered milk] supplemented with 100 p-g/ml denatured salmon sperm DNA. Blots were 
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washed in 3X SSC [20X SSC: 450 mM NaCl; 45 mM Na 3 citrate.2H 2 0; pH 7.0, HC1] and 1 % 
SDS for 1 hour at 65°C, and exposed to X-ray films (X-OMAT AR, Kodak) at -80°C with 
intensifying screen. 

RT-PCR Library Construction and Screening 

5 

A RT-PCR library was constructed from germinating oilseed rape seedlings using a 
"Perkin Elmer GeneAmp® RNA PCR Kit" and following the manufacturer's 
recommendations. Total RNA (1 ug) extracted from cotyledons 3 days after seeds imbibition 
was reverse transcribed [65°C, 2 minutes; 42°C, 30 minutes; 99°C, 5 minutes; 6°C, 5 

1 0 minutes] using an oligo(dT) primer (MPRACE IB) stabilised for the PCR. The heteroduplex 
DNA-RNA was subsequently amplified by PCR using the same downstream primer as for the 
reverse transcription and a set of two upstream degenerate primers (DEGCYS1 and 
DEGCYS2) designed on the peptide level from a motif of cysteine protease coding regions 
conserved among most of the plant species [GCNCCLM(NED)). Cycling conditions: [(95°C, 

1 5 2 minutes; 55°C, 2 minutes; 72°C, 1 .5 minutes) x 2 cycles; (95°C, 2 minutes; 55°C, 1 minute; 
72°C, 1.5 minutes) x 33 cyclesj followed by 7 minutes at 72°C. 

The RT-PCR products were ligated into a pCRII vector before transforming E. coli 
using the "TA Cloning® Kit" (Invitrogen) following the manufacturer's recommendations. 
This system takes advantage of the non-template dependent activity of thermostable 

20 polymerases used in PCR that add a single deoxyadenosine to the 3 '-ends of all duplex 
molecules provided by PCR. This allows direct cloning into a pCRII vector which contains 
overhanging deoxythimidine. Plasmid DNA was purified using "Wizard DNA miniprep DNA 
purification system"(Promega) and sequenced by the chain termination method (Sanger et al, 
1977) with "Sequenase version 2.0 T7 DNA polymerase" (USB) according to the 

25 manufacturer's recommendations. 



OLIGOS 


SEQUENCE ( 5'-» 3* ) 


MPRACE IB 


GGC CAC GCG TCG ACT AGT TAC TCG AGT 
TTT TTT TTT TTTTTT T 


DEGCYS1 


GGI TG(CT) AA(CT) GGI GGI (CT)TI ATG 


DEGCYS2 


GGI TG(CT) AA(CT) GGI GGI (CT)TI ATG (GA)A 
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Analysis 

The RT-PCR products were analysed by electrophoresis on an agarose gel. As can be 
seen in Figure 2, two major 750 bp fragments of the expected size, amplified from cotyledons 

5 RNA, but not from seeds RNA were found. These were gel excised and cloned. In a parallel 
approach, aliquots of the RT-PCR products without gel purification were cloned to generate 
a three days-old expressed cysteine protease library containing 400 clones from which 22 
came from gel excision. A colony screening with the oligos used for the RT-PCR identified 
250 putative cysteine proteases. Seven clones from gel excision were taken at random and 

1 0 fully sequenced, all of them were cysteine proteases, and fell into 3 classes. 



These clones were given the following designations : 



CLONES 


CLASS 


OSR8.401 


1 


OSR8.402 


2 


OSR8.403 


2 


OSR8.404 


2 


OSR8.406 


6 


OSR387 


6 


OSR.389 


6 



The preliminary DNA sequences of these clones and their alignment is shown in 
Figures 3A and 3B. 

Three clones, one per class, OSR8.401 (class 1); OSR8.402 (class 2) and OSR8.389 
(class 6) whose preliminary DNA sequences are shown in Figure 4, were labelled by random 
priming and assessed on northern blots containing total RNA extracted from seeds and 
cotyledons using the lithium chloride method. They appeared to be well expressed during 
germination but not in dry seeds. Furthermore, some cross-hybridisation was observed 



WO 97/35983 



-21 - 



PCT/GB97/00729 



between the different classes due to the nature of the probe. This is illustrated in Figure 5 
which shows the result for the class 2 clone, OSR8.402, with a random primed whole RT- 
PCR fragment as probe. 

The library was then screened using the five cysteine protease clones labelled by 
5 random priming. No new classes were clearly identified, all the diversity of clones was 

present in the gel-excised fragments. The expression pattern of the three classes of clones was 
assessed by northern blots with RNA, extracted with the caesium chloride method, from a 
range of developmental stages as shown in Figure 6. For class 2 and 6, hybridisations were 
performed using the 3'-non coding regions of clones OSR8.402 and OSR8.389 respectively, 

10 to avoid any cross-hybridisation. For class 1, a part of the coding region of OSR8.401 clone 
was used as a probe because no non-coding region was available in the RT-PCR clones. To 
increase the specificity, probes were labelled by PCR using the oligo(dT) reverse primer 
(MPRACE1B) and an internal reverse primer (CYS8.402 and CYS8.389) for classes 2 and 6, 
whilst using 2 internal primers (CYS8.401 and CYS8.401R) for class 1. Classes 2 and 6 are 

1 5 expressed following seed imbibition and for the first 4-5 days of early seedling growth but are 
not expressed in mature plant organs or in the developing seed. Class 1 shows some 
expression in buds and leaves but this may be the result of some cross-hybridisation due to the 
nature of the probe. Class 2 is highly related to COT44, the only cysteine protease published 
for oilseed rape (Comai and Harrada, 1989). Figure 7 shows alignment of deduced amino 

20 acid sequences of the clones with COT44. 



OLIGOS 


SEQUENCE ( 5'-» 3' ) 


CYS8.401 


TAT CCT TAT CAA GAA CGT GAT GGC A 


CYS8.401R 


CCT ACG ATG AGC ACT GCG TGG T 


CYS8.402 


GCA GTA ATC AAA TTG GGA TTG TTA TAA 


CYS8.389 


CGT GGA ACC AGC AGT GTT TGA AGT T 
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Example 2 - CONSTRUCTION OF A STANDARD OLIGOd(T) PRIMED AND A 
CYSTEINE PROTEASE SPECIFICALLY PRIMED cDNA LD3RARY FROM 
GERMINATING OILSEED RAPE SEEDLINGS 

5 As the RT-PCR products were not full length and to avoid PCR generated mutations, 

an oilseed rape cDNA library was constructed from a developmental stage showing a high 
expression of the three clones of interest. A specifically primed library was constructed using 
two specific oligos designed on the basis of the three classes of RT-PCR clones, rather than 
using an oligo(dT) primer. As a result only the cysteine protease clones were reverse 

10 transcribed, which provided a small number of short clones of about 650 bp, all of them being 
full length at the 5'-end. This allowed the design of oligos to the 5'-non-coding regions for 
use in screening a standard oligo(dT) primed cDNA library directly for full length clones. 
This general approach is shown in schematically in Figure 8. 

Preparatory Methods : 

IS Plant material 

Five grams of oilseed rape seeds {Brassica napus) from the variety Westar were 
sterilised in 1% sodium hypochlorite for 10 minutes. After several washes in sterile water, 
seeds were imbibed with sterile water for 12 hours at 4°C in the dark to synchronise the 
germination. They were sown on wet sterile Whattman paper and grown at 25°C in the dark 
20 for 2 days prior to harvesting the cotyledons. 

RNA extraction and purification 

Total RNA was isolated from 2 days-old oil seed rape seedlings using the caesium 
chloride method described previously. Polyadenylated RNA was purified from 1 mg of total 
RNA using a "PolyATract mRNA isolation system" (Promega), according to the 
25 manufacturer's recommendations. The system uses a biotinylated oligo(dT) primer to 

hybridise at high efficiency in solution to the 3'-poly(A) region of the mRNAs. The hybrids 
were then captured and washed at high stringency using streptavidin coupled to paramagnetic 
particles and a magnetic separation strand, prior to elution with water. 
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cDNA Library Construction and Screening 

The standard oligo(dT) primed cDNA library and the cysteine protease specifically 
primed library were constructed from 2 days-old oilseed rape poly(A) RNA (5 fig), using a 
lambda "ZAP-cDNA® Synthesis Kit" (Stratagene). The manufacturer's recommendation 
were followed strictly although, for the specific library, the reverse transcription was primed 
using a mix of two specific oligos (CDNA8.401R and CDNA8387R) respectively for the 
class 1 and for classes 2 and 6 of the RT-PCR clones, modified to include a Xho I site at their 
5' -end. The second strand was synthesised by nick-translation using DNA polymerase I, after 
treatment of the heteroduplex with RNase H. The cDNAs were filled in with Klenow, ligated 
to EcoR I adapters and digested with Xho I prior to size-fractionation on Sepharose®-400 
spun column (Pharmacia) and directional cloning as an EcoR l-Xho I insert into the polylinker 
of pBluescript phagemid contained within the Uni-ZAP vector arms. Lambda-ZAP is a 
replacement lambda which has been engineered to contain pBluescript phagemid, which 
polylinker is used for cloning the cDNAs. 



OLIGOS 


SEQUENCE (5'-> 3') 


CDNA8.401R 


GAG AGA G_AG_ AGA GAG AGA GAA CTA GTC 
TCG AGT CCC ATG GTT TTT AAT 


CDNA8.387R 


GAG AGA GAG AGA GAG AGA GAA CTA GTC 


TCG AGC CGC CGT TTT TCA T 



The library was packaged in vitro using Gigapack® II Gold packaging extract 
(Stratagene), plated on E. coli cell line XLl-Blue MRF and transferred onto nylon 
membranes (Hybond N, Amersham) according to the manufacturer's recommendations. 
20 Labelling of probes, hybridisations and washes were performed as described previously. 

Selected lambda-ZAP clones were excised in vivo to recover the cloned cDNA as a 
phagemid in pBluescript SK\ £ coli SolR cells were co-transfected with the recombinant 
lambda-ZAP and with a helper phage which provided the proteins necessary for the synthesis 
of a single strand of DNA which, once circularised, provide a functional phagemid. 



25 
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Analysis 

a) A small cysteine protease-enriched specifically primed cDNA library containing 1 . 10 4 
plaque forming units (pfu) was obtained from 2 days-old oilseed rape cotyledons. 5. 10 3 pfu 
were plated and transferred onto three replicate membranes to check for cross-hybridisation. 
5 Membranes were screened with 3 oligos (CYS8.40 1MR, CYS8.402MR, CYS8.406MR) 
designed respectively to the 5'-end of classes 1, 2 and 6 of the RT-PCR clones and labelled as 
described above. 

Five similar but distinct 5' -end cysteine protease cDNAs containing a short 5'- 
untranslated region and 650 bp of coding region were obtained, excised in vivo and 

10 sequenced. They fall into class 2 and class 6 but no 5'-end cDNA was found for class 1 . 

Although clones from class 2 are highly related to COT44 cDNA (Comai and Harrada, 1 989), 
their 5'-end is 160 bp longer. This indicates that COT44 is missing, 46 amino acids (aa) 
corresponding to the signal peptide and to a part of the propeptide. This is further illustrated 
by Figure 9 which shows the alignment of the deduced amino acid sequence of the cDNA 

1 5 clones CYS2UP6, CYS2UP7 and CYS2UP8 from class 2 with COT44. 

Two oligos (CYS6B-UP and CYS6A-UP) were designed to the 5'-non-coding region 
of respectively, cDNAs from classes 2 and 6, to screen the standard oligo(dT) primed library 
directly for full length clones. The preliminary sequence alignment of the clones with each 
other and COT 44 is shown in Figure 10. 



OLIGOS 


SEQUENCE (5'-> 3') 


CYS8.401MR 


CCC ATG GTT TTT AAT GAC AAA TTG AAA A 


CYS8.402MR 


CCG CCG TTT TTC ATT ATG AAT TGA AA 


CYS8.406MR 


CGC CGT TTT TCA TGA TGA ATT GAA AA 



b) A representative oligo(dT) primed cDNA library containing 1 . 10 7 pfu was obtained 
from 2 days-old oilseed rape cotyledons. The size of the inserts, estimated by PGR, is ranging 
from 0.75 kb to 3 kb with an average insert size of 1 .5 kb. 1 . 10 6 pfu were plated and 
25 transferred onto three replicate membranes to check for cross-hybridisation. Membranes 
were first screened with a 5*-end oligo (CYS8.401MR, CYS6B-UP and CYS6A-UP) 
designed respectively to the 5 '-end coding region of class 1 RT-PCR clones and to the 5'- 
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untranslated region of class 2 and class 6 cDNAs to identify full length clones for class 2 and 
6 (class 1, 20 positives; class 2 , 250 positives; class 6, 130 positives). 



OLIGOS 


SEQUENCE (5'-> 3') 


CYS8.401MR 


CCC ATG GTT TTT AAT GAC AAA TTG AAA A 


CYS6B-UP 


TAG AAA ACC AAC AAA ACA AAC ATA CAA T 


CYS6A-UP 


GAA CAA CCA AGC CAA ACA TAC AAT AT 



5 The same membranes were screened again with the 3' -end probes used on the 

developmental northern blots to ensure the correct form was chosen. For classes 2 and 6, 
hybridisations were performed using the 3' -non-coding regions labelled by PCR to avoid any 
cross hybridisation. For class 1, a part of the coding region was used as a probe because non- 
coding region was not available in the RT-PCR clone (class 1, 84 positives; class 2, 205 
10 strong positives; class 6, 85 strong positives). 

Comparison between the number of clones identified with both specific probes (5' and 
3*) confirmed that most of the cysteine protease cDNA clones present in the library are full 
length. 

Ten clones per class which hybridised with (5' and 3' probes) but did not cross 
15 hybridise with probes from the two other classes were plaque purified and four clones per 
class were excised in vivo. Six full length cysteine protease cDNA clones falling into the 
three classes of cysteine proteases identified from the RT-PCR work were isolated and fully 
sequenced (two for class 1, three for class 2, and one for class 6). The alignment of the 
cDNA clones is shown in Figure 11. They fall into 3 classes of CP related to the papain super 
20 family and the pre-proenzymes share 52% (class 1), 90% (class 6) and 96% identity (class 2) 
with cot44. Figures 12 to 17 show the nucleic acid sequences of clones CDCYS 12, 
CDCYS 14, CDCYS22, CDCYS24, CDCYS25 and CDCYS66 respectively. The peptide 
sequences were predicted and showed the characteristic features present in most of the plant 
cysteine proteases, as shown in Figure 18 for clones CDCYS 12, CDCYS 14, CDCYS22, 
25 CDCYS24, CDCYS 25 and CDCYS66. 



WO 97/35983 



-26- 



PCT/GB97/00729 



Example 3 •• SCREENING OF AN OILSEED RAPE GENOMIC LIBRARY AND 
SUBCLONING AND CHARACTERISATION OF THE PROMOTER REGIONS 

Oligonucleotide probes were generated to the 5' -end non-coding region of one cDNA 
clone per class and used to screen a genomic library in order to isolate clones carrying the 
promoter regions. For each class, genomic clones were isolated and the promoter subcloned 
into a phagemid for more precise characterisation and deletion. 

Genomic Library Construction and Screening 

An amplified XEMBL-3 random genomic library (Clontech) from oilseed rape 
(Brassica napus cv. Bridger) was constructed. DNA was partially digested with Mbo I and 
the fragments were separated on a sucrose gradient to produce size range between 8 to 22 kb 
before cloning into the BamH I site of a XEMBL-3 replacement vector. The library was 
plated on E. coli strain LE392 cells. Screening and plaque purification were performed as 
described by Sambrook et al. (1989). Genomic clones corresponding to the three classes of 
cDNAs were isolated and JlDNA minipreparations were carried out using a protocol from 
Grossberger (1987). Genomic clones were mapped using their restriction fragment length 
polymorphism (RFLP) patterns: clones were digested with a set of restriction enzymes, 
analysed on a 0.8% agarose gel and simultaneously transferred onto two membranes (Hybond 
N, Amersham) according to Sambrook et al, (1989) prior to hybridisation. 

Analysis 

For the primary screening twenty genome-equivalent (2.10 6 pfu) were plated and 
transferred onto three replicate membranes to check for cross-hybridisation. Membranes 
were hybridised with 3 oligos (CDNAI2, CDNA25 and CDNA66) designed respectively to 
the 5*-end of classes 1, 2 and 6 of the cDNA clones, to get as close as possible to the 
promoter area (class 1, sixteen strong positives; class 2, nine strong positives; class 6, eight 
strong positives). No cross-hybridisation was detected between the three classes and ten 
clones per class were chosen for a secondary screening. 
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OLIGOS 


SEOUENCE ( 5'-» 3' ) 


CDNA12 


ATC GTC TTC TTC CTT TGT TTC TCT CA 


CDNA25 


CTT CGT CAG CGA AAC TCC TCT CTT 


CDNA66 


CAG AAC TAG AAC AAC CAA GCC AAA C 



The secondary screening was performed using 3' -end PCR probes as described in 
connection with Figure 6, to specifically detect the genomic clones corresponding to the RT- 
PCR clones assessed by northern blots. Some of the clones were not identified by these 
5 probes (one for class 1 and two for class 6), since these were likely to be short clones in their 
3'-end, thus useful for promoter isolation, they were rescued using the probes from the 
primary screening. Ten clones per class were plaque purified by two more rounds of 
purification using the probes from the primary screening. In order to avoid redundant clones 
(amplified, random library), DNA was prepared from eight genomic clones per class and 

10 characterised by RFLP. Clones were cut with Sal I and BamU I, analysed on a 0.8% agarose 
gel, transferred onto replicate membranes and hybridised with two sets of probes. Oligos 
were designed to the 5'-non-coding region (CDNA12, CYS6B-UP and CYS6A-UP) and to 
the middle of the coding region (CYS8.401MR, CYS8.406MR and CYS8.406MR) of cDNA 
clones CDCYSI2, CDCYS25 and CDCYS66 respectively. 

15 Four remaining clones per class (12g4, 12g5, 12g6, 12g8; 25g2, 25g4, 25g5, 25g7; 

66gl, 66g4, 66g8 and 66g9) were further characterised using another round of 
digestion/hybridisation. Class 1 cDNAs contain a Bgl U site 500 bp from the translation start 
and class 2 and 3 cDNAs contain a Hind III site 300 bp from the translation start, these 
enzymes were used in association with and without 5a/ 1, which releases the insert, to 

20 generate genomic fragments suitable for subcloning. PCR experiments were carried out on 
genomic DNA and cDNAs to predict the size of the promoter area by identifying putative 
introns. PCR, using a forward primer in the 5' -non-coding region and a reverse primer located 
after the Bgl II and Hind III restriction sites, showed the presence of a 400 bp intron within 
the first 600 bp of class 1 cDNAs whilst no intron was present within the first 300 bp of class 

25 2 and class 6. Promoter fragments with a predicted size in the range of 2-5 kb were identified 
for one genomic clone per class (12g6, 25g7 and 66gl), ready to be subcloned into 
pBluescript KS*. 
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Example 4 - CHARACTERISATION OF TRANSCRIPTION STARTS 

Genomic lambda-fragments containing the promoter were subcloned into pBluescript 
5 KS + for more precise characterisation. Sequencing allowed the identification of putative 
transcription signals before mapping the actual transcription start by primer extension 
experiments. This involved the extension of a labelled reverse primer designed in an area 
close to the translation start. After degradation of the RNA template the extension products 
were sized in a polyacrylamide gel. 

10 

Analysis 

Genomic fragments containing the promoter were subcloned into pBluescript as a Bgl 
II-2.6 kb insert cloned in BamU I for class 1 (pKS 12P6), as a Hind ID-4.2 kb insert for class 
2 (pKS25P7) and as a BamH l-Hind EI-2.4 kb insert for class 6 (pKS66Pl). Sequencing 
15 with pUCl and pUC4 vector oligos and with two internal reverse primers designed to the 5'- 
endofthecDNAs (CDNA14R for class 1 and CDNA66R for class 2 and 6), allowed the 
orientation of the clones and the identification of putative transcription signals (Pautot et 
a/., 1989). The full nucleotide sequence of the promoters from the sub-cloned genomic 
fragments is given in Figures 19, 20 and 21 respectively for class 1, 2 and 6. 



20 



OLIGOS 


SEQUENCE ( 5'-> 3') 


CDNA14R 


GAA GAA ACT AGA AGA AGG GAG AAG AA 


CDNA66R 


TCA CTT CTT CAT CGG TTC TCC A 



Transcription starts have been mapped precisely by primer extension experiments 
according to Sambrook et al. (1989) modified as follows. Oligos (CYSGE12R, CYSGE25R 
and CYSGE66R) designed respectively for class 1, 2 and 6 were labelled by terminal 
25 exchange as described previously. Total RNA (50 ug), isolated from 3 days-old oilseed rape 
cotyledons using the caesium chloride method, was precipitated together with 2 ng of primers 
and resuspended in 30 pi hybridisation buffer [1 mM EDTA; 400 mM NaCl; 40 mM Pipes, 
pH 6.4; 70%, v/v. deionised formamide]. Annealing was performed overnight at 32°C 
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following denaturing at 85°C for 10 minutes. After precipitation and resuspension in 25 (il of 
reverse transcription buffer [50 mM KC1; 10 mM MgCl 2 ; 1 mM dNTPs; 1 U/ul RNase 
inhibitor], the primers were extended for 90 minutes at 42°C with 2.5 U MuLV reverse 
transcriptase (Perkin-Elmer). Template RNA was degraded for 30 minutes at 37°C with 20 U 
5 of RNase (RNace-it, Stratagene). For each class, the extension products were analysed on a 
polyacrylamide denaturing gel in parallel with a sequencing reaction performed on the 
genomic clones (pKS12P6, pKS25P7 and pKS66Pl) using the same primers as for the 
primer extension. 



OLIGOS 


SEQUENCE (5'-> 3') 


CYSGE12R 


AGG AAG AAG ACG ATG ATG GTG ACA 


CYSGE25R 


GTA CAA GAG AAG TAA AGA GAG GAG T 


CYSGE66R 


CGT ATA GGA GAA GTA AAG AAA TGA GT 



10 

As shown in Figure 22, class 1 transcription start is in a good context 
[t27T3 5 C49A 7 8a,8C45g8%] compared to 49 other plant genes compiled by Pautot et al (1989). 
Conserved nucleotides are in capitals, important ones are in bold and the transcription start 
point is underlined. This transcription start has been mapped 22 nucleotides downstream a 
15 putative TATA box localised 179 nucleotides before the translation start and 152 nucleotides 
upstream the longest cDNA. 

Although the consensus for the TATA box is not optimal 
Ic34ai8t32t34a3A97T 9 oA94a47A95a3oA7iG44%] compared to 79 other plant genes compiled by 
Pautot et al (1989), this result is a confirmation of a previous primer extension experiment 
20 using an oligo priming 34 nucleotides downstream the ATG. The distance between the 
transcription start and the longest cDNA might be explained by the presence of an intron 
within the 5 'untranslated region or by the existence of an alternative transcription start point. 

The Class 2 transcription start is in a good context [a,8a 2 oa22A78T49C45A43%L and has 
been localised 33 nucleotides after a putative TATA box fitting very well within the plant 
25 consensus [T37g n gi4t34T96A97T9oA94a47A95T 6 3A7iG44%]. This corresponds to 53 nucleotides 
before the translation start and 26 nucleotides upstream the cDNA (Figure 22). 

Class 6 transcription start is nearly in the same context as class 2 
[gi8a2oa22A 78 T< 9 C 4 5A43%] and has been localised 30 nucleotides after a putative TATA box 
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showing exactly the same consensus as for class 2. This corresponds to 5 1 nucleotides before 
the translation start and 19 nucleotides upstream the cDNA (Figure 22). 

Example 5 - PROMOTER EXCISION 

Prior to fusion with the reporter genes, promoters must be cut precisely between the 
transcription start and the translation start. Since no useful restriction site was available for 
the class 2 and 6 genomic clones, a site was engineered into a PCR fragment used to replace a 
corresponding endogenous fragment. 

Analysis 

A Hind m site was introduced by PCR on class 1 genomic clone, 2 nucleotides before 
the translation start, in order to eliminate the remaining part of coding region. The 270 bp 
fragment was generated by 15 cycles of PCR on pKS12p6 DNA, using CYSGE12C and 
CYSG12CRoligos. 

In the same way, a BamH I site was introduced before the translation start of class 2 
and class 6 genomic clones using 2 sets of oligos (CYSGE25C, CYSG25CR and 
CYSGE66C, CYSG66CR) respectively on pKS25p7 and pKS66pl DNA to generate a 165 
bp fragment. 

For class 1, pKS12p6 was cut with Bsm I and Hind m and gel recovered to excise a 
900 bp fragment prior to replacement with the PCR fragment cut by Bsm I and Hind III, to 
generate pKS12P. 

For class 2, pKS25p7 was cut with Sph I and BamH I, gel recovered to excise a 460 
bp fragment and ligated to the replacement PCR fragment cut by Sph I and BamH I, to 
generate pKS25P. 

For class 6, pKS66pl was cut with Hind m, filled in with Klenow fragment of DNA 
polymerase I, cut with Sph I and gel recovered to excise a 440 bpfragment. The PCR 
fragment was blunt ended with T4 DNA polymerase, cut with Sph I and cloned into the 
deleted pKS66pl to generate pKS66P. 
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OLIGOS 




CYSGE12C 


GTA ATG GCC TAG CCT GTC TGG C 


CYSG12CR 


GAT GAT GGT GAC AAG CTT TTT CTT ACA GG 


CYSGE25C 


CTA TCT TGC ATG CCC ATT ATT ACT TT 


CYSG25CR 


ACG AAG CCG GAT CCT ATG TTT GTT TTG TTG 


CYSGE66C 


CAT CTT GCA TGC CCA TTA CTG CAT 


CYSG66CR 


AGG AAG CCG GAT CCT ATG TTT GGC TTG G 



Example 6 - PROMOTER-GUS CONSTRUCTS 



In order to assess the spatial and temporal regulation of the cloned promoter regions in an 
heterologous system and under different biotic and environmental conditions, they were used 
them to drive a reporter gene into tobacco. Transcriptional fusions between each promoter 
fragment and the (3-glucuronidase (GUS) gene were engineered for plant assays and 
histochemical localisation. For the avoidance of doubt, a reporter gene is used here for 
convenience only, and to demonstrate the principles involved. In non-test situations the gene 
controlled by the promoter of the present invention will be that which produces the desired 
effect. 

Plasmid Construction 

Standard recombinant DNA methods were adopted in the construction of plasmid 
vectors (Sambrook et ai, 1989). The CP12 Hind lll-Not 1-1.7 kb promoter fragment was 
excised from pKS12P, filled in using Klenow fragment of DNA polymerase I and ligated into 
the Sma I site of the Agrobacterium Ti vector pTAKl containing the E.coli uidA gene 
encoding ^-glucuronidase (Jefferson et al„ 1987), to produce a pTAKCP12 binary vector. 
In the same way CP25 Hind Ul-BamH 1-3.7 kb and CP66 Bamti. 1-1.9 kb promoter fragments 
were excised from pKS25P and pKS66P respectively, and ligated into pTAKl cut with the 
same enzymes to produce pTAKCP25 and pTAKCP66 binary vectors. All constructs were 
transformed into E. Coli strain DH5a as an intermediate host for the vectors construction. The 
structure of the resultant chimeric reporter gene constructs was verified by PCR, restriction 
digest and sequence analysis. Figure 23 shows a schematic of the constructs for plant 
transformation. 
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Plant Transformation 

Plasmids pTAKCP12, pTAKCP25 and pTAKCP66 were transferred into 
Agrobacteriun tumefaciens LBA4404 using the freeze/thaw method described by Holsters et 
al. (1978). Tobacco (Nicotiana tabacum var. Samsun) transformants were produced by the 
5 leaf disc method (Bevan, 1 984). Shoots were regenerated on medium containing 1 00mg/l 
kanamycin and 200 mg/1 carbenicillin. After rooting, plantlets were transferred to the 
glasshouse and grown under 16 h light/ 8 h dark conditions. 



Results 





12CP plants 


25CP plants 


66CP plants 


# shoot taken 


440 


250 


404 


# shoot rooting 


130 (30%) 


82 ( 33%) 


120 (30%) 


# shoot subdivided 


100 


82 


100 


# shoot re-rooting 


84 (84%) 


58 (70%) 


71 (71%) 



PRIMARY TRANSFORMANT ANALYSIS 

Objective 

Primary transformants were first analysed by polymerase chain reaction (PCR) in 
order to reduce the number of plants to analyse and to make sure they contained the intact 
15 promoter-reporter gene cassette. 

Since promoters can be deregulated in callus, GUS analysis were carried out on calli 
from transformation to make sure the promoters are not active at this stageas this might have 
had a deleterious effect on future transformations efficiency depending on the nature of the 
transgene (e.g. if bamase ribonuclease gene is driven by one of these promoters). Promoter 
20 activity was also assessed in young leaves from primary transformants to confirm the absence 
of ectopic expression at this stage. 

Polymerase chain reaction 

Genomic DNA for PCR analysis of transgenic plants was prepared according to 

Edwards et al. (1992). Plant extracts DNA (2.5 ul) was amplified [hot start at 80°C ;(94°C, 1 
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min; 63°C, 1 min; 72°C, 1 min) x 35 cycles; ( 72°C, 7 min) x 1 cycle] using 2 U of Taq 
polymerase (Gibco BRL) in a 25 ul PCR reaction [200 uM dNTPs; 3 mM MgC12; 1 uM 
oligos; IX PCR buffer]. 

Results 

A total of 37 individual transformants per class were randomly picked from in-vitro 
culture 12CP, 25CP and 66CP explants and analysed with 2 sets of primers. The first set 
contained one primer specific to the 5' end of the NOS terminator of the NPTII gene 
(NOSTER1) and a reverse-primer specific to the 5'end of the cloned promoters (CYSGE12R, 
CYSGE25R or CYSGE66R). The second set contained one primer specific to the 3 '-end of 
the promoters (CYSGE12C, CYSGE25C or CYSGE66C) and a reverse-primer specific to 
the 5' portion of the GUS gene (GUS1R). A total of 34 explants was found to be double 
PCR positive for class 6 (94% of the plants tested), while for class 1 and class 2 only 27 
plants gave the expected result (73% of the plants tested). Plants containing the intact 
cassette were transferred to the glasshouse and self-pollinated. 



OLIGOS 


SEQUENCE ( 5'-> 3' ) 


CYSGE12RT 


GGG TTC TTC TGG GTA GCA AAC TG 


CYSGE25RT 


ACT TCA CGT TCT GAA TCT CAT CGA A 


CYSGE66RT 


GGG CCA GAA TGC GGA TTT TAC TAA 


GUS1R 


CGC TTT CCC ACC AAC GCT GAT C 


NOSTER1 


TTGAAT CCT GTT GCC GGT CTT GC 



GUS enzyme assays 

Fluorometric assays for GUS activity performed with the substrate 4- 
methylumbelliferyl-D-glucuronide (Sigma) were carried out using a Perkin-Elmer LS-35 
fiuorometer (Jefferson et al, 1987). Protein concentration of tissue homogenates were 
determined by the Bio-Rad protein assay (Bradford, 1976). 

Results 

GUS assays were carried out for each class on 20 regenerating calli resulting from the 
transformation process. Tobacco extracts from wild type leaves and callus as well as from 
leaves from 35S-GUS transgenic plants were used respectively as negative and positive 
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controls. As shown in Figure 24, no significant GUS activity could be detected in calli 
compared to the levels presents in leaves from the 35S-GUS plants. 

Figure 25 shows a preliminary assessment of the levels of GUS activity in young 
leaves from primary transformants of each class. Results indicate that the promoters are not 
5 active at this stage. This is a confirmation of the northern blot results obtained for class 2 and 
6 but contradicts those for class 1 (Figure 6). Class 1 CP showed some expression in leaves 
but it is thought to be due to a cross-hybridisation problem due to the nature of the probe. 
The GUS result seemed to confirm the latest hypothesis. 

ANALYSIS OF SEGREGATING POPULATIONS 

10 Objective 

The objective was to select for each class, four GUS expressing lines ranging from 
low expressors to high exprcssors, preferably from lines with a single locus insertion of the 
transgene as this facilitates the comparisons between lines. The number of loci in the primary 
1 5 transformants is estimated by the segregation of the NPTII (kanamycin-resistance) gene in the 
progeny. 

Segregation test 

Seeds were sterilised in 10% bleach for 15 min. After several washes in sterile water 
20 around 150 seeds were sown on Yi MS medium (2.3 g/1 MS salt, 1 .5% sucrose, 0.8% 

Bactoagar, pH 5.9) containing 100 mg/1 kanamycin. Seeds were grown.for three weeks at 
26°C with 16 hours/8 hours light/dark prior to scoring. If the primary transformants 
contained one copy of the transgene, the expected ratio for kan R to kan s seeds was 3 to 1 
(although in very rare cases one locus could possibly contain several transgenes). 

25 Results 

A substantial number of plants are showing a petaloidie phenotype in which a variable 
proportion of the flowers in a plant have one or more normal stamens replaced by petals. 
Some of these plants were so badly affected that we could not recover any seeds. 

The table below summarises the genetic data for the primary transformants. 
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12CP plants 


25CP plants 


66CP plants 


# lines in glasshouse 


27 


27 


34 


# lines giving seeds 


22 (81%) 


22 (81%) 


29 (85%) 


# petaloidie phenotype 


13 (48%) 


11 (41%) 


5 (15%) 


# single loci insertion 


1 1 (50%) 


9(41%) 


12(41%) 



Preliminary time course experiment 

5 The northern blot results indicated an accumulation of CP mRNA in oil seed rape at 2 

to 3 days after seeds imbibition (DAI). However, the heterologous expression in tobacco may 
differ from the endogenous expression in oil seed rape due to differences in physiology and 
transcription machinery. Furthermore, the activity of the promoter was indirectly analysed 
through a reporter protein, which delays the detection. So, in order to work out at which 
10 time point all the Fl generations should be analysed, the time course of GUS expression from 
each promoter had to be established. 

Results 

The experiment was carried out on 2 random lines per class as well as on wild type 
15 and 35S-GUS control lines. For each time point, 40 seeds of lines 12CP5, 12CP14, 25CP8, 
25CP13, 66CP8 and 66CP76, as well as the controls, were grown at 26°C with 16 hours/8 
hours light/dark, on plates containing 1/2 MS media. Seedlings were sampled at 0, 2, 4, 6, 8, 
10, 29 and 36 DAI and stored at -70°C. At time point 0 to 10 DAI, the 10 biggest seedlings 
were collected while only 5 were taken at 29 and 36 DAI which increased the variability 
20 within these samples. 

In tobacco seedlings grown under the conditions described above, three lines out of 6 
showed an induction of GUS expression during germination (Figure 26). The activity peaked 
at around 30 DAI, although the protein accumulation clearly started at 8 DAI. Thus, the Fl 
generations should be assessed at 14 and 28 DAI to identify a range of GUS expressors. The 
25 maximum of activity was about 5% of 35S-GUS in the best expressing line (12CP5) although 
the relatively high level of expression in leaves suggests that this might be due to a position 
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effect of the transgene. Normal levels of expression are more likely to be around 1 % of 35S- 
GUS. 

Identification of high GUS-expressing lines during germination. 
5 Results 

Seedlings were grown on 1/2 MS media supplemented with 100 mg/1 kanamycin in 
the conditions described previously. Five seedlings were harvested at 0 (dry seeds), 14 (2 
expanded leaves) and 28 DAI (4 expanded leaves), pooled and assessed in duplicates as 
described previously. 

10 Figures 27, 28, and 29 summarise the expression levels for class 1 , 2 and 6 

respectively. These preliminary data suggest that the promoters are expressed in a seedling- 
specific manner in tobacco. As expected from the RNA study in oilseed rape the levels of 
expression are low. Class 2 promoter fragment is more active than class 1 at this stage, while 
class 6 gives extremely low levels of expression. 

15 

GUS histochemical detection 

GUS histochemical staining of whole seedlings was achieved by vacuum infiltration 
with a solution containing 1 mM X-gluc (5-bromo-4-chloro-3-indolyl -D-glucuronide, 100 
20 mM NaP04 pH 7.5, 10 mM EDTA, 0.5 mM K3Fe(CN)6, 0.5 mM K4Fe(CN)6, 0.1% Titon 
X-100, 0.1% DMSO. After 12 h incubation at 37°C intact seedling were photographed. 
Alternatively, stained seedlings were vacuum infiltrated with Tissue-Tek OCT compound 
prior to freezing in liquid nitrogen. A bright cryostat microtome (model 5030) was used to cut 
20 |im sections at -23°C. 

25 Other modifications of the present invention will be apparent to those skilled in the art 

without departing from the scope of the invention. 
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Claims 

1 . A promoter comprising the DNA sequence of an oil seed rape cysteine protease gene 
promoter of class 1, 2 or 6. 

2. A promoter comprising at least part of the DNA sequence as shown in Figure No. 19, 
or at least part of a sequence that has substantial homology therewith, or a variant 
thereof. 

3. A promoter comprising at least part of the DNA sequence as shown in Figure No. 20 
or at least part of a sequence that has substantial homology therewith, or a variant 
thereof. 

4. A promoter comprising at least part of the DNA sequence as shown in Figure 2 1 or at 
least part of a sequence that has substantial homology therewith, or a variant thereof. 

5. A promoter having the characteristic motif or features of a promoter of any one of the 
preceding claims. 

6. A recombinant DNA construct comprising the promoter of any preceding claim 
operably linked to a gene of interest. 

7. A recombinant DNA construct functional in a plant comprising a disrupter gene 
encoding a product capable of disrupting cell function, and a promoter according to 
any preceding claim for the disrupter gene, the disrupter gene being functionally linked 
to and controlled by an externally regulatable gene control region which includes a 
promoter which is inducible by the external application of a chemical inducer. 

8. A recombinant DNA construct according to claim 7 wherein the inducible promoter is 
functionally linked to and controls a repressor protein gene and in which the disrupter 
gene promoter includes an operator sequence which is recognised by the repressor 
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protein, so that in the presence of the inducer the repressor protein is produced which 
interacts with the operator sequence thereby disabling the second promoter and 
inhibiting expression of the disrupter gene. 

9. A recombinant DNA construct according to claim 7 or claim 8 wherein the disrupter 
gene is a nucleotide sequence, which is in sense orientation to an endogenous plant 
gene which is essential to plant development or a gene conferring a desired 
characteristic on the plant, or comprises a partial sense sequence of the endogenous 
plant gene. 

10. A recombinant DNA construct according to claim 7 or claim 8 wherein the disrupter 
gene is a nucleotide sequence, which is in antisense orientation to an endogenous plant 
gene which is essential to plant development or a gene conferring a desired 
characteristic on the plant. 

11. A recombinant DNA construct according to claim 9 or claim 10 wherein the 
endogenous plant gene is essential to seed germination or early seedling development. 

12. A recombinant DNA construct according to any one of claims 7 to 1 1 wherein the 
externally regulatable gene control region is a chemically inducible gene promoter 
sequence from the glutathione S-transferase system, the Ale system or the ecdysone 
system. 

13. A recombinant DNA construct according to any one of claims 8 to 13 wherein the 
repressor protein gene encodes a bacterial repressor. 

14. A recombinant DNA construct according to claim 13 wherein the repressor protein 
gene encodes the lac repressor or a repressor used by 434, P22 or lambda- 
bacteriophages. 
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15. A recombinant DNA construct according to any one of claims 7 to 1 4 wherein the 
disrupter gene or disrupter promoter contains a "pseudo-operator". 

16. A recombinant DNA construct according to any one of claims 7 to 15 wherein the 
disrupter gene is a cytotoxic gene. 

17. A recombinant DNA construct according to any one of claims 7 to 1 5 wherein the 
disrupter gene encodes a recombinase or transposase adapted to excise a nucleotide 
sequence flanked by recombinase recognition sequences. 

18. DNA comprising at least part of the sequence shown in Figure 12 or at least part of a 
sequence that has substantial homology therewith or a variant thereof, and which 
codes for a cysteine protease. 

19. DNA comprising at least part of the sequence shown in Figure 13 or at least part of a 
sequence that has substantial homology therewith or a variant thereof, and which 
codes for a cysteine protease. 

20. DNA comprising at least part of the sequence shown in Figure 14 or at least part of a 
sequence that has substantial homology therewith or a variant thereof and which codes 
for a cysteine protease. 

2 1 . DNA comprising at least part of the sequence shown in Figure 1 5 or at least part of a 
sequence that has substantial homology therewith or a variant thereof, and which 
codes for a cysteine protease. 

12. DNA comprising at least part of the sequence shown in Figure 16 or at least part of a 
sequence that has substantial homology therewith or a variant thereof, and which 
codes for a cysteine protease. 
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23. DNA comprising at least part of the sequence shown in Figure 17 or at least part of a 
sequence that has substantial homology therewith or a variant thereof, and which 
codes for a cysteine protease. 

24. A recombinant DNA construct functional in a plant comprising the DNA of any one of 
claims 18 to 23 operably linked to a promoter. 

25. A recombinant DNA construct according to any one of claims 6 to 1 7 and 24 wherein 
the construct is capable of being expressed in the tissue or tissues of a germinating 
seedling or a developing grain or a plant when the construct is integrated , preferably 
stably integrated, within the grain's or seedling's or plant's genomic DNA. 

26. An expression system for the tissue or tissues of a plant material, the expression 
system comprising 

a gene of interest fused to a gene promoter as defined in any one of claims 1 to 5 
wherein the expression system is capable of being expressed in the tissue or tissues of 
the plant material. 

27. An expression system according to claim 26 wherein the expression system is for at 
least the tissue of a germinating seedling or developing grain or plant (eg in the root, 
cotyledons, leaves and stem). 

28. An expression system according to claim 26 or claim 27 wherein the expression 
system is integrated, preferably stably integrated, within a germinating seedling's 
genomic DNA or a developing grain's genomic DNA or a plant's genomic DNA. 

29. An expression system comprising a construct according to any one of claims 6 to 1 7 
and 24. 
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30 A recombinant plant genome comprising a promoter of any one of claims 1 to 5, DNA 
of any one of claims 18 to 23, a recombinant DNA construct of any one of claims 6 to 
17 or 24 or an expression system according to any one of claims 25 to 29. 

31. A plant, plant seed or plant cell having a recombinant plant genome of claim 30. 

32. Protected plant germplasm comprising a plant comprising a recombinant DNA 
construct of any one of claims 6 to 17 or 24. 

33. A plant or seed which is incapable of growing to maturity comprising a recombinant 
DNA construct of any one of claims 6 to 17 or 24. 

34. The use of a gene promoter as defined in any one of claims 1 to 5 to induce expression 
of a gene of interest when fused to the gene promoter in the tissue or tissues of a plant 
material. 

35. The use according to claim 34 wherein the gene promoter is used to induce expression 
of a gene of interest when fused to the gene promoter in at least the tissue or tissues of 
a germinating seedling or a developing grain or a plant (eg in the root, cotyledons, 
leaves and stem). 

36. A promoter, a construct or an expression system substantially as hereinbefore 
described with reference to any one of Figures 12 to 17 and 19 to 21. 
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